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Abstract 

We present a unified approach to proving Ramsey-type theorems for graphs with a forbidden 
induced subgraph which can be used to extend and improve the earher results of Rodl, Erdos- 
Hajnal, Promel-Rodl, Nikiforov, Chung-Graham, and Luczak-RodL The proofs are based on a 
simple lemma (generalizing one by Graham, Rodl, and Rucihski) that can be used as a replacement 
for Szemeredi's regularity lemma, thereby giving much better bounds. The same approach can be 
also used to show that pseudo-random graphs have strong induced Ramsey properties. This leads 
to explicit constructions for upper bounds on various induced Ramsey numbers. 

1 Background and Introduction 

Ramsey theory refers to a large body of deep results in mathematics concerning partitions of large 
structures. Its underlying philosophy is captured succinctly by the statement that "In a large system, 
complete disorder is impossible." This is an area in which a great variety of techniques from many 
branches of mathematics are used and whose results are important not only to graph theory and 
combinatorics but also to logic, analysis, number theory, and geometry. Since the publication of the 
seminal paper of Ramsey [43] in 1930, this subject has grown with increasing vitality and is currently 
among the most active areas in combinatorics. 

For a graph H, the Ramsey number r{H) is the least positive integer n such that every two-coloring 
of the edges of the complete graph Kn on n vertices contains a monochromatic copy of H. Ramsey's 
theorem states that r{H) exists for every graph H. A classical result of Erdos and Szekeres [26], which 
is a quantitative version of Ramsey's theorem, implies that r{Kk) < for every positive integer k. 
Erdos [19] showed using probabilistic arguments that r(Kk) > 2^^/^ for k > 2. Over the last sixty 
years, there has been several improvements on the lower and upper bounds of r(i^fc), the most recent 
by Conlon [T2]. However, despite efforts by various researchers, the constant factors in the exponents 
of these bounds remain the same. 

A subset of vertices of a graph is homogeneous if it is either an independent set (empty subgraph) 
or a clique (complete subgraph). For a graph G, denote by hom(G) the size of the largest homogeneous 
subset of vertices of G. A restatement of the Erdos-Szekeres result is that every graph G on n vertices 
satisfies hom(G) > |logn, while the Erdos result says that for each n > 2 there is a graph G on n 
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vertices with hom(G) < 21ogn. (Here, and throughout the paper, all logarithms are base 2.) The 
only known proofs of the existence of Ramsey graphs, i.e., graphs for which hom(G) = O(logn), come 
from various models of random graphs with edge density bounded away from and 1. This supports 
the belief that any graph with small hom(G') looks 'random' in one sense or another. There are now 
several results which show that Ramsey graphs have random-like properties. 

A graph H is an induced subgraph of a graph G if V{H) C V{G) and two vertices of H are adjacent 
if and only if they are adjacent in G. A graph is k-universal if it contains all graphs on at most k 
vertices as induced subgraphs. A basic property of large random graphs is that they almost surely are 
A;-universal. There is a general belief that graphs which are not fe-universal are highly structured. In 
particular, they should contain a homogeneous subset which is much larger than that guaranteed by 
the Erdos-Szekeres bound for general graphs. 

In the early 1970's, an important generalization of Ramsey's theorem, known as the Induced 
Ramsey Theorem, was discovered independently by Deuber y/6j, Erdos, Hajnal, and Posa [25], and 
Rodl [44]. It states that for every graph H there is a graph G such that in every 2-edge-coloring of G 
there is an induced copy of H whose edges are monochromatic. The least positive integer n for which 
there is an n- vertex graph with this property is called the induced Ramsey number r[^c^{H). All of 
the early proofs of the Induced Ramsey Theorem give enormous upper bounds on rind(-ff)- It is still 
a major open problem to prove good bounds on induced Ramsey numbers. Ideally, we would like to 
understand conditions for a graph G to have the property that in every two-coloring of the edges of 
G, there is an induced copy of graph H that is monochromatic. 

In this paper, we present a unified approach to proving Ramsey- type theorems for graphs with a 
forbidden induced subgraph which can be used to extend and improve results of various researchers. 
The same approach is also used to prove new bounds on induced Ramsey numbers. In the few 
subsequent sections we present in full detail our theorems and compare them with previously obtained 
results. 



1.1 Ramsey properties of H-free graphs 

As we already mentioned, there are several results (see, e.g., [STJ HTJ [21 [10]) which indicate that 
Ramsey graphs, graphs G with relatively small hom(G), have random-like properties. The first advance 
in this area was made by Erdos and Szemeredi [2^, who showed that the Erdos-Szekeres bound 
hom(G) > ^ log n can be improved for graphs which are very sparse or very dense. The edge density 
of a graph G is the fraction of pairs of distinct vertices of G that are edges. The Erdos-Szemeredi 
theorem states that there is an absolute positive constant c such that hom(G) > for every graph 

G on n vertices with edge density e S (0, 1/2). This result shows that the Erdos-Szekeres bound can 
be significantly improved for graphs that contain a large subset of vertices that is very sparse or very 
dense. 

Rodl [45j proved that if a graph is not fc-universal with k fixed, then it contains a linear-sized 
induced subgraph that is very sparse or very dense. A graph is called H-free if it does not contain H 
as an induced subgraph. More precisely, Rodl's theorem says that for each graph H and e E (0, 1/2), 
there is a positive constant 5(e, H) such that every H-fiee graph on n vertices contains an induced 
subgraph on at least S{e,H)n vertices with edge density either at most e or at least 1 — e. Together 
with the theorem of Erdos and Szemeredi, it shows that the Erdos-Szekeres bound can be improved 
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by any constant factor for any family of graphs that have a forbidden induced subgraph. 

Rodl's proof uses Szemeredi's regularity lemma [48], a powerful tool in graph theory, which was 
introduced by Szemeredi in his celebrated proof of the Erdos-Turan conjecture on long arithmetic 
progressions in dense subsets of the integers. The regularity lemma roughly says that every large 
graph can be partitioned into a small number of parts such that the bipartite subgraph between 
almost every pair of parts is random-like. To properly state the regularity lemma requires some 
terminology. The edge density d{X, Y) between two subsets of vertices of a graph G is the fraction of 
pairs {x,y) € X xY that are edges of G, i.e., d{X, Y) = ^\x\'\Y\ ' "^^ere e{X, Y) is the number of edges 
with one endpoint in X and the other in y. A pair (X, Y) of vertex sets is called e-regular if for every 
X' C X and Y' C Y with \X'\ > e\X\ and > e\Y\ we have \d{X',Y') - d{X,Y)\ < e. A partition 
V = IJi=i ^ is called equitable if ||Vi| — \ Vj\\ < 1 for all 

Szemeredi's regularity lemma 08| states that for each e > 0, there is a positive integer M(e) such 
that the vertices of any graph G can be equitably partitioned V{G) = Ui=i ^ i^ito k subsets with 
< k < M(e) satisfying that all but at most ek'^ of the pairs {Vi,Vj) are e-regular. For more 
background on the regularity lemma, see the excellent survey by Komlos and Simonovits |37j . 

In the regularity lemma, M(e) can be taken to be a tower of 2's of height proportional to e~^. On 
the other hand, Gowers [31] proved a lower bound on M(e) which is a tower of 2's of height proportional 
to e~T6. His result demonstrates that M(e) is inherently large as a function of e~^. Unfortunately, 
this implies that the bounds obtained by applications of the regularity lemma are often quite poor. 
In particular, this is a weakness of the bound on (5(e, H) given by Rodl's proof of his theorem. It is 
therefore desirable to find a new proof of Rodl's theorem that does not use the regularity lemma. The 
following theorem does just that, giving a much better bound on 5{e, H). Its proof works as well in a 
multicolor setting (see concluding remarks). 

Theorem 1.1 There is a constant c such that for each e £ (0, 1/2) and graph H on k > 2 vertices, 
every H-free graph on n vertices contains an induced subgraph on at least 2"'^'^^'°^ "•'^n vertices with 
edge density either at most e or at least 1 — e. 

Nikiforov [IT] recently strengthened Rodl's theorem by proving that for each e > and graph H of 
order k, there are positive constants k = K{e, H) and G = C(e, H) such that for every graph G = {V, E) 
that contains at most kIFI'^ induced copies of there is an equitable partition V = IJ^j of the 
vertex set such that the edge density in each (i > 1) is at most e or at least 1 — e. Using the same 
technique as the proof of Theorem 11.11 we give a new proof of this result without using the regularity 
lemma, thereby solving the main open problem posed in [41j . 

Erdos and Hajnal [23] gave a significant improvement on the Erdos-Szekeres bound on the size 
of the largest homogeneous set in H-hee graphs. They proved that for every graph H there is a 
positive constant c{H) such that hom(G) > 2^^^^^^^^^ for all H-fiee graphs G on n vertices. Erdos 
and Hajnal further conjectured that every such G contains a complete or empty subgraph of order 
rf-W^ This beautiful problem has received increasing attention by various researchers, and was also 
featured by Gowers [32] in his list of problems at the turn of the century. For various partial results 
on the Erdos-Hajnal conjecture see, e.g., [H [24l |29l [Sj |28l [Ml US] and their references. 

Recall that a graph is /c-universal if it contains all graphs on at most k vertices as induced subgraphs. 
Note that the Erdos-Hajnal bound, in particular, implies that, for every fixed k, sufficiently large 
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Ramsey graphs are /c-universal. This was extended further by Promel and Rodl [l2], who obtained an 
asymptoticaUy best possible result. They proved that if hom(G) < ci logn then G is C2 log n-universal 
for some constant C2 which depends on ci. 

Let hom(n, k) be the largest positive integer such that every graph G on n vertices is /c-universal 
or satisfies hom(G) > hom(n, A;). The Erdos-Hajnal theorem and the Promel- Rodl theorem both say 
that hom(n, k) is large for fixed or slowly growing k. Indeed, from the first theorem it follows that for 
fixed k there is c{k.) > such that hom(n, k) > 2^('^')v^'°s", while the second theorem says that for each 
ci there is C2 > such that hom(n, C2 log n) > ci log n. One would naturally like to have a general 
lower bound on hom(n, k) that implies both the Erdos-Hajnal and Promel-Rodl results. This is done 
in the following theorem. 

Theorem 1.2 There are positive constants C3 and C4 such that for all n, k, every graph on n vertices 

/ log 71 

is k-universal or satisfies hom(G) > 0^2 "'V fc logn. 

Theorem 1 1 . 1 1 can be also used to answer a question of Chung and Graham [13j . which was motivated 
by the study of quasirandom graphs. Given a fixed graph H, it is well known that a typical graph on n 
vertices contains many induced copies of if as n becomes large. Therefore if a large graph G contains 
no induced copy of H, its edge distribution should deviate from "typical" in a rather strong way. This 
intuition was made rigorous in [13], where the authors proved that if a graph G on n vertices is not 
fc-universal, then there is a subset S of [^J vertices of G such that \e{S) — j^ri'^l > 2~'^^^^~^^'^^n'^ . For 
positive integers k and n, let D(k, n) denote the largest integer such that every graph G on n vertices 
that is not /c-universal contains a subset S of vertices of size [^J with |e(5) — j^n"^] > D{k,n). Chung 
and Graham asked whether their lower bound on D(k, n) can be substantially improved, e.g., replaced 
by c~^n'^. Using Theorem 1 1 . 1 1 this can be easily done as follows. 

A lemma of Erdos, Goldberg, Pach, and Spencer [22] implies that if a graph on n vertices has a 
subset R that deviates by D edges from having edge density 1/2, then there is a subset S of size [n/2\ 
that deviates by at least a constant times D edges from having edge density 1/2. By Theorem 11.11 
with e = 1/4, there is a positive constant G such that every graph on n vertices that is not fc-universal 
has a subset R of size at least G~^n with edge density at most 1/4 or at least 3/4. This R deviates 
from having edge density 1/2 by at least 

4 V 2 y - 16' ' - 16 

edges. Thus, the above mentioned lemma from [22] implies that there is an absolute constant c such 
that every graph G on n vertices which is not A;- universal contains a subset S of size \n/2\ with 
\e{S) — Ygl > c~^n'^. Chung and Graham also ask for non-trivial upper bounds on D{k,n). In this 
direction, we show that there are -fCfc-free graphs on n vertices for which \e{S) — = 0(2~'^/^n^) 

holds for every subset S of [^J vertices of G. Together with the lower bound it determines the 
asymptotic behavior of D{k,n) and shows that there are constants ci,C2 > 1 such that c^^n'^ < 
D{k,n) < c'^^n^ holds for all positive integers k and n. This completely answers the questions of 
Chung and Graham. 

Moreover, we can obtain a more precise result about the relation between the number of induced 
copies of a fixed graph H m & large graph G and the edge distribution of G. In their celebrated paper. 
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Chung, Graham, and Wilson [H] introduced a large collection of equivalent graph properties shared 
by almost all graphs which are called quasirandom. For a graph G = (V, E) on n vertices, two of these 
properties are 

Pi : For each subset S* C F, 

e[S) = \\S\^ + o{n'). 

P2: For every fixed graph H with k vertices, the number of labeled induced copies of in G is 

(l + o(l))n'=2-(2). 

So one can ask naturally, by how much does a graph deviate from Pi assuming a deviation from P2? 
The following theorem answers this question. 

Theorem 1.3 Let H be a graph with k vertices and G = iV^E) he a graph with n vertices and at 
most (1 — e)2~^'^n^ labeled induced copies of H . Then there is a subset S CV with \S\ = ln/2\ and 
\e{S) — ^1 > €c~'^n'^, where c is an absolute constant. 

The proof of Theorem 11.31 can be easily adjusted if we replace the "at most" with "at least" and 
the (1 — e) factor by (1 + e). Note that this theorem answers the original question of Chung and 
Graham in a very strong sense. 

1.2 Induced Ramsey numbers 

Recall that the induced Ramsey number rind(-ff) is the minimum n for which there is a graph G with 
n vertices such that for every 2-edge-coloring of G, one can find an induced copy of -ff in G whose 
edges are monochromatic. One of the fundamental results in graph Ramsey theory (see chapter 9.3 
of [IZ]), the Induced Ramsey Theorem, says that rind(-f^) exists for every graph H. Rodl [l5] noted 
that a relatively simple proof of the theorem follows from a simple application of his result discussed 
in the previous section. However, all of the early proofs of the Induced Ramsey Theorem give poor 
upper bounds on rind(-ff)- 

Since these early proofs, there has been a considerable amount of research on induced Ramsey 
numbers. Erdos [21j conjectured that there is a constant c such that every graph H on k vertices 
satisfies rind(^^) < 2^^^^. Erdos and Hajnal [2U] proved that r[^^{H) < 2^* holds for every graph 
H on k vertices. Kohayakawa, Promel, and Rodl [36l improved this bound substantially and showed 
that if a graph H has k vertices and chromatic number then rind(-f^) ^ A;^'=(i°sx)^ where c is a 
universal constant. In particular, their result implies an upper bound of 2'^'^('°s'^)^ on the induced 
Ramsey number of any graph on k vertices. In their proof, the graph G which gives this bound is 
randomly constructed using projective planes. 

There are several known results that provide upper bounds on induced Ramsey numbers for sparse 
graphs. For example, Beck [8] studied the case when H is a tree; Maxell, Kohayakawa, and Luczak |35j 
proved that the cycle of length k has induced Ramsey number linear in k; and, settling a conjecture 
of Trotter, Luczak and Rodl [ID] showed that the induced Ramsey number of a graph with bounded 
degree is at most polynomial in the number of its vertices. More precisely, they proved that for every 
integer d, there is a constant q such that every graph H on k vertices and maximum degree at most 
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d satisfies ri^d{H) < /c^'*. Their proof, which also uses random graphs, gives an upper bound on Cd 
that is a tower of 2's of height proportional to (f. 

As noted by Schaefer and Shah [l6], all known proofs of the Induced Ramsey Theorem either rely 
on taking G to be an appropriately chosen random graph or give a poor upper bound on ri^d{H). 
However, often in combinatorics, explicit constructions are desirable in addition to existence proofs 
given by the probabilistic method. For example, one of the most famous such problems was posed 
by Erdos [5j, who asked for the explicit construction of a graph on n vertices without a complete or 
empty subgraph of order clogn. Over the years, this intriguing problem and its bipartite variant has 
drawn a lot of attention by various researches (see, e.g., [301 HI El [9l [7] ) , but, despite these efforts, it is 
still open. Similarly, one would like to have an explicit construction for the Induced Ramsey Theorem. 
We obtain such a construction using pseudo-random graphs. 

The random graph G{n,p) is the probability space of all labeled graphs on n vertices, where every 
edge appears randomly and independently with probability p. An important property of G{n,p) is 
that, with high probability, between any two large subsets of vertices A and B, the edge density 
d{A, B) = is approximately p. This observation is one of the motivations for the following 

useful definition. A graph G = {V,E) is {p, X) -pseudo-random if the following inequality holds for all 
subsets A,B CV: 



It is easy to show that if p < 0.99, then with high probability, the random graph G{n,p) is (p, A)- 
pseudo-random with A = 0{^/pn). Moreover, there are also many explicit constructions of pseudo- 
random graphs which can be obtained using the following fact. Let Ai > A2 > . • • > An be the 
eigenvalues of the adjacency matrix of a graph G. An (n, d, \)-graph is a d-regular graph on n vertices 
with A = maxj>2 |Aj|. It was proved by Alon (see, e.g., [5], [38]) that every (n, d, A)-graph is in fact 
(^, A)-pseudo-random. Therefore to construct good pseudo-random graphs we need regular graphs 
with A <C d. For more details on pseudo-random graphs, including many constructions, we refer the 
interested reader to the recent survey [38]. 




A graph is d- degenerate if every subgraph of it has a vertex of degree at most d. The degeneracy 
number of a graph H is the smallest d such that H is d-degenerate. This quantity, which is always 
bounded by the maximum degree of the graph, is a natural measure of its sparseness. In particular, 
in a d-degenerate graph every subset X spans at most d\X\ edges. The chromatic number x{H) of 
graph H is the minimum number of colors needed to color vertices of H such that adjacent vertices get 
different colors. Using a greedy coloring, it is easy to show that d-degenerate graphs have chromatic 
number at most d-\-l. The following theorem, which is special case of a more general result which we 
prove in Section 4, shows that any sufficiently pseudo-random graph of appropriate density has strong 
induced Ramsey properties. 

Theorem 1.4 There is an absolute constant c such that for all integers k,d,x ^ 2, every (^,n'^'^)- 
pseudo-random graph G on n > k'^'^^°s>i vertices satisfies that every d-degenerate graph on k vertices 
with chromatic number at most x occurs as an induced monochromatic copy in all 2 -edge- colorings of 
G. Moreover, all of these induced monochromatic copies can be found in the same color. 

This theorem implies that, with high probability, G{n,p) with p = 1/k and n > k'^'^^°sx satisfies 
that every d-degenerate graph on k vertices with chromatic number at most x occurs as an induced 
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monochromatic copy in all 2-edge-colorings of G. It gives the first polynomial upper bound on the 
induced Ramsey numbers of d-degenerate graphs. In particular, for bounded degree graphs this is a 
significant improvement of the above mentioned Luczak-Rodl result. It shows that the exponent of the 
polynomial in their theorem can be taken to be 0{dlogd), instead of the previous bound of a tower 
of 2's of height proportional to cP. 

Corollary 1.5 There is an absolute constant c such that every d- degenerate graph H on k vertices 
with chromatic number X ^ 2 has induced Ramsey number rind{H) < k'^'^^"^^. 

A significant additional benefit of Theorem II. 41 is that it leads to explicit constructions for induced 
Ramsey numbers. One such example can be obtained from a construction of Delsarte and Goethals 
and also of Turyn (see [38]). Let r be a prime power and let G be a graph whose vertices are the 
elements of the two dimensional vector space over finite field F,., so G has vertices. Partition the 
r + 1 lines through the origin of the space into two sets P and N , where |P| = t. Two vertices x and y 
of the graph G are adjacent if a; — y is parallel to a line in P. This graph is known to be t{r — l)-regular 
with eigenvalues, besides the largest one, being either —t or r — t. Taking t = , we obtain an 

(n, d, A)-graph with n = r'^, d = n/k, and X = r — t<r< n^/^. This gives a (p, A)-pseudo-random 
graph with p = d/n = 1/k and A < n^/^ which satisfies the assertion of Theorem 11.41 

Another well-known explicit construction is the Paley graph P„. Let n be a prime power which is 
congruent to 1 modulo 4 so that —1 is a square in the finite field F„. The Paley graph P^ has vertex 
set F„ and distinct elements x, y G F„ are adjacent if x — y is a square. It is well known and not 
difficult to prove that the Paley graph Pn is (1/2, A)-pseudo-random with A = y/n. This can be used 
together with the generalization of Theorem 11.41 which we discuss in Section 5, to prove the following 
result. 

Corollary 1.6 There is an absolute constant c such that for prime n > 2^^^^°^^^ , every graph on k 
vertices occurs as an induced monochromatic copy in all 2-edge-colorings of the Paley graph P„. 

This explicit construction matches the best known upper bound on induced Ramsey numbers of 
graphs on k vertices obtained by Kohayakawa, Promel, and Rodl [36]. Similarly, we can prove that 
there is a constant c such that, with high probability, G(n, 1/2) with n > 2'^'^^°^ ^ satisfies that every 
graph on k vertices occurs as an induced monochromatic copy in all 2-edge-colorings of G. 

Very little is known about lower bounds for induced Ramsey numbers beyond the fact that an 
induced Ramsey number is at least its corresponding Ramsey number. A well-known conjecture of 
Burr and Erdos [11] from 1973 states that for each positive integer d there is a constant c{d) such that 
the Ramsey number r{H) is at most c{d)k for every d-degenerate graph H on k vertices. As mentioned 
earlier, Haxell et al. [35] proved that the induced Ramsey number for the cycle on k vertices is linear 
in k. This implies that the induced Ramsey number for the path on k vertices is also linear in k. 
Also, using a star with 2fc — 1 edges, it is trivial to see that the induced Ramsey number of a star 
with k edges is 2k. It is natural to ask whether the Burr-Erdos conjecture extends to induced Ramsey 
numbers. The following result shows that this fails already for trees, which are 1-degenerate graphs. 

Theorem 1.7 For every c > and sufficiently large integer k there is a tree T on k vertices such that 
rind(T) > ck. 
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The tree T in the above theorem can be taken to be any sufficiently large tree that contains 
a matching of linear size and a star of linear size as subgraphs. It is interesting that the induced 
Ramsey number for a path on k vertices or a star on k vertices is linear in k, but the induced Ramsey 
number for a tree which contains both a path on k vertices and a star on k vertices is superlinear in k. 

Organization of the paper. In the next section we give short proofs of Theorem 1 1 . 1 1 and Theorem 
11.21 which illustrate our methods. Section [3] contains the key lemma that is used as a replacement for 
Szemeredi's regularity lemma in the proofs of several results. We answer questions of Chung-Graham 
and Nikiforov on the edge distribution in graphs with a forbidden induced subgraph in Section HI In 
Section [5] we show that any sufficiently pseudo-random graph of appropriate density has strong induced 
Ramsey properties. Combined with known examples of pseudo-random graphs, this leads to explicit 
constructions which match and improve the best known estimates for induced Ramsey numbers. The 
proof of the result that there are trees whose induced Ramsey number is superlinear in the number 
of vertices is in Section [H The last section of this paper contains some concluding remarks together 
with a discussion of a few conjectures and open problems. Throughout the paper, we systematically 
omit floor and ceiling signs whenever they are not crucial for the sake of clarity of presentation. We 
also do not make any serious attempt to optimize absolute constants in our statements and proofs. 

2 Ramsey-type results for H-free graphs 

In this section, we prove Theorems 11.11 and 11.21 While we obtain more general results later in the 
paper, the purpose of this section is to illustrate on simple examples the main ideas and techniques 
that we will use in our proofs. Our theorems strengthen and generalize results from [l5] and [42j and 
the proofs we present here are shorter and simpler than the original ones. We start with the proof of 
Theorem II. H which uses the following lemma of Erdos and Hajnal p3]. We prove a generalization of 
this lemma in Section HI 

Lemma 2.1 For each e G (0,1/2), graph H on k vertices, and H-free graph G = iV^E) on n > 2 
vertices, there are disjoint subsets A and B of V with \A\, \B\ > such that either every vertex 

in A has at most e\B\ neighbors in B, or every vertex in A has at least (1 — e)\B\ neighbors in B. 

Actually, the statement of the lemma in [23] is a bit weaker than that of Lemma 12.11 but it is easy 
to get the above statement by analyzing more carefully the proof of Erdos and Hajnal. Lemma l2.ll 
roughly says that every H-free graph contains two large disjoint vertex subsets such that the edge 
density between them is either very small or very large. However, to prove Theorem I l.H we need to 
find a large induced subgraph with such edge density. Our next lemma shows how one can iterate the 
bipartite density result of Lemma 12.11 in order to establish the complete density result of Theorem 11.11 

For ei, 62 G (0, 1) and a graph H, define 5(ei, 62, H) to be the largest 6 (which may be 0) such that 
for each H-iree graph on n vertices, there is an induced subgraph on at least 6n vertices with edge 
density at most ei or at least 1 — £2. Notice that for 2 < no < ni, the edge-density of a graph on ni 
vertices is the average of the edge-densities of the induced subgraphs on no vertices. Therefore, from 
definition of 5, it follows that for every 2 < no < 6{€i,e2, H)n and H-iree graph G on n vertices, G 
contains an induced subgraph on exactly no vertices with edge density at most ei or at least 1 — €2- 
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Recall that the edge-density d{A) of a subset A of G equals e(A)/('^'), where e{A) is the number of 
edges spanned by A. 

Lemma 2.2 Suppose ei,e2 G (0,1) with ei + €2 < 1 and H is a graph on k > 2 vertices. Let 
e = min(ei, £2). We have 

S{eue2,H) > {e/4)''k~'mm (5(3ei/2, es, i^) , 5(ei, 3e2/2, i/)) . 

Proof. Let G be a H-fiee graph on n > 2 vertices. If n < A; then we may consider any two- vertex 
induced subgraph of G which has always density either or 1. Therefore, for G of order less than k 
we can take 6 = 2/k, which is clearly larger than the right hand side of the inequality in the assertion 
of the lemma. Thus we can assume that n > k. Applying Lemma |2. II to G with e/4 in place of e, we 
find two subsets A and B with \A\, \B\ > {e/4)^~^n/k, such that either every vertex in A is adjacent 
to at most j\B\ vertices of B or every vertex of A is adjacent to at least (1 — j)\B\ vertices of B. 

Consider the first case in which every vertex in A is adjacent to at most j\B\ vertices of B (the 
other case can be treated similarly) and let G[A] be the subgraph of G induced by the set A. By 
definition of function 6, G[A] contains a subset A' with 

J ^<<5(3ei/2, £2,^)1^1, 

such that the subgraph induced by A' has edge density at most |ei or at least 1 — €2- If A' has 
edge density at least 1 — e2 we are done, since G[A'] is an induced subgraph of G with at least 
(e/4)'^/c~-^(5(3ei/2, 62, -ff)n vertices and edge density at least 1 — 62. So we may assume that the edge 
density in A' is at most |ei. 

Let Bi <Z B he those vertices of B that have at most ||^'| neighbors in A' . Since A' C A, each 
vertex of A' has at most j\B\ neighbors in B and the number of edges e{A' ,B) < Therefore 
Bi has at least |5|/2 vertices. Then, by definition of 5, Bi contains a subset B' with 

^) ^<5{?,ei/2,e2,H)\Bil 

such that the induced subgraph G[B'] has edge density at most |ei or at least 1 — €2- If it has edge 
density at least 1 — £2 we are done, so we may assume that the edge density d{B') is at most |ei. 

Finally to complete the proof note that, since \A'\ = \B'\, \A' U B'\ = 2\A'\, d{A'),d{B') < |ei, 
and d{A' , B') < we have that 

e{A'uB') = e(yl')+e(S') + e(A',B0<^6i('f')+^ei('f' 
= 2ei\A'\'^ -3ei\A'\/2 <€i 

Therefore, d{A' U B') < ei. □ 

From this lemma, the proof of our first result, that every H-free graph on n vertices contains a 
subset of at least 2"*^^^^'°^ n vertices with edge density either < e or > 1 — e, follows in a few lines. 
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Proof of Theorem ll.lt Notice that if ei + 62 > 1, then triviaUy d{ei,e2, H) = 1. In particular, if 
^1^2 > |, then ei + €2 > 1 and S{ei, e2,H) = 1. Therefore, by iterating Lemma [2^2] for t = log ^/ log | 
iterations and using that e < 1/2, we obtain 

5{e,e,H) > {^-^^ > 2-isiiV2('=(i°gi/^)'+(2fc+iogfc)iogiA) ^ 2-^5kiiogi/e)2 ^ 

which, by definition of 5, completes the proof of the theorem. □ 
Recall the Erdos-Szemeredi theorem, which states that there is an absolute constant c such that 
every graph G on n vertices with edge density e € (0, 1/2) has a homogeneous set of size at least ■ 
Theorem 11.21 follows from a simple application of Theorem 11.11 and the Erdos-Szemeredi theorem. 
Proof of Theorem II. 2t Let G be a graph on n vertices which is not /c-universal, i.e., it is H-fiee 

_ 1 I log 

for some fixed graph H on k vertices. Fix e = 2 sV fe and apply Theorem 11.11 to G. It implies that 
G contains a subset W C V{G) of size at least 2""'^^'^^^°^ n = r?l^ such that the subgraph induced 
by W has edge density at most e or at least 1 — e. Applying the Erdos-Szemeredi theorem to the 
induced subgraph G\W\ or its complement and using that elogl/e < 46^/2 for all e < 1, we obtain a 
homogeneous subset W C W with 

iTi^/i ^ clogn^/^ clogn ^ c 1 / log" 

\W'\> > ^ > — 210V logn, 

' ' ~ elogi - lOei/2 - 10 ^ ' 

which completes the proof of Theorem 11.21 □ 



3 Key Lemma 

In this section we present our key lemma. We use it as a replacement for Szemeredi's regularity lemma 
in the proofs of several Ramsey-type results, thereby giving much better estimates. A very special 
case of this statement was essentially proved in Lemma 12.21 in the previous section. Our key lemma 
generalizes the result of Graham, Rodl, and Rucinski [34J and has a simpler proof than the one in |34j . 
Roughly, our result says that if (Gi, . . . , G-r) is a sequence of graphs on the same vertex set Y with 
the property that every large subset of V contains a pair of large disjoint sets with small edge density 
between them in at least one of the graphs Gj, then every large subset of V contains a large set with 
small edge density in one of the Gj. To formalize this concept, we need a couple definitions. 

For a graph G = (F, E) and disjoint subsets W\, . . . ,Wt C V, the density dciWi, . . . , Wt) between 
the t > 2 vertex subsets Wi , ... ,Wt is defined by 



dciWi, . . . ,Wt[ 

If \Wi\ = ... = \Wt\, then 



E^<jm\w,\ 



dciWu . . . , = Q E ^g(1^., Wj). 



Also, in this section if t = 1 we define the density to be zero. 
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Definition 3.1 For a,p,€ £ [0,1] and positive integer t, a sequence {Gi, . . . ,Gr) of graphs on the 
same vertex set V is (a, p,e,t)-sparse if for all subsets U C V with \U\ > a\V\, there are positive 
integers ti,...,tr such that 111=1 ^« — ^ and for each i £ [r] = {l,...,r} there are disjoint subsets 
W,,i, Wi,u C U with |VFi,i| = . . . = \Wi,t^ = \p\U\] and dcAW^^i, Wi,u) < e. 

We call a graph (a, p, e, t)-sparse if the one-term sequence consisting of that graph is (a, p, e, t)- 
sparse. By averaging, if a' > a, p' < p, e' > e, t' < t, and {Gi, . . . ,Gr) is (a, /?, e, i)-sparse, then 
(Gi, . . . , Gr) is also (a', p' , e' , t')-sparse. The following is our main result in this section. 

Lemma 3.2 // a sequence of graphs {Gi, . . . ,Gr) with common vertex set V is {^ap, p' , e,t)-sparse 
and {a, p,e/4:,2)-sparse, then [Gi, . . . ,Gr) is also {a, ^pp' , e,2t)-sparse. 

Proof. Since (Gi, . . . , Gr) is (a, p, e/4, 2)-sparse, then for each U C V with \U\ > a\V\, there is z G [r] 
and disjoint subsets X,Y C U with \X\ = \Y\ = p\U\ and dG,{X,Y) < e/4. Let Xi be the set of 
vertices in X that have at most ||y| neighbors in 1" in graph Gj. Then eGi{X\Xi,Y) > e|X\Xi||y|/2 
and we also have eGi{X,Y) < e\X\\Y\/4:. Therefore \Xi\ > \X\/2 > ^p\U\ and by removing extra 
vertices we assume that \Xi\ = ^p\U\. 

Since {Gi,...,Gr) is (^ap, p', e, i)-sparse, then there are positive integers ti,...,tr such that 
0^=1 *i — * for each j £ [r] there are disjoint subsets Xj^i, . . . , Xj^t^ C Xi of size \Xj^i\ = 
. . . = \Xj^tj I = p'\Xi I with density dc^ [Xj^i, . . . , Xj^tj ) < £• Let Yi the set of vertices in Y that have at 
most e|Xj^iU. . .UXj^tJ neighbors in Xi^iU. . .UXi^a in graph Gi. Since every vertex of Xi is adjacent to 
at most vertices of y and since Xi^iU . . .UXj^^. C Xi we have that ddiXi^iU . . .UXj^t.,y) < e/2. 
On the other hand, d^iXi^i U . . . U Xi^a, Y \ Yi) > £• Therefore |yi| > |y|/2, so again we can assume 
that |yi| = \p\U\ = \Xi\. Since is (^ap, p', e, t)-sparse, then there are positive integers 

si, . . . , Sr such that 11^=1 — * each j £ [r] there are disjoint subsets Yj^i, . . . , Yj^sj C Yi with 

dc^ (y^- 1, . . . , Yj^s,) < e and \Yj,i\ = ... = \Yj,s,\ = p'\Yi\. 

By the above construction, the edge density between Xn U . . . U Xj and Yn U . . . U Yi^^^ is 
bounded from above by e. We also have that both dc^iXi^i, . . . , Xi^tJ and dg'. (yj_i, . . . , 1^.^.) are at 
most e and these two sets have the same size. Therefore dc^^Xi^i, . . . , Xi^ti,Yi,i, ■ ■ ■ , < implying 
that (Gi, . . . , Gr) is (a, ^pp', e, n) -sparse with u = {ti + Si) Y[j(^[r]\{i} max(tj, sj) for some i. By the 
arithmetic mean-geometric mean inequality, we have 

r r j 

t^< 

i=i i=i Vi6[r]\{i} 

Thus u > 2t. Altogether this shows that (Gi, . . . ,Gr) is (a, ^pp' , e,2t) -sparse, completing the proof. 
□ 

Rather than using this lemma directly, in applications we usually need the following two corollaries. 
The first one is obtained by simply applying Lemma 13.21 h — 1 times. 

Corollary 3.3 // (Gi, . . . , Gr) is {a, p, e/4, 2)-sparse and h is a positive integer, then (Gi, . . . , Gr) is 
also 2^-^p'', e, 2^*) -sparse. 



(ti + Si)^'' - 4 • 
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If we use the last statement with h = r log ^ and a = {^)^ ^, then we get that there is an index 
i G [r] and disjoint subsets Wi, . . . ,Wt C V with t > 2^1'' = \^\Wx\ = . . . = \Wt\ = 2}-^p^\V\, and 
d-Gi^Xi ■ ■ ■ i^t) < £• Since (''^^') < f(*'^^')) even if every Wi has edge density one, still the edge 
density in the set W\ U . . . U VFj is at most 2e. Therefore, (using e/2 instead of e) we have the following 
corollary. 

Corollary 3.4 If {Gi, . . . ,Gr) is {{^)^^^ , p, e/8,2)-sparse where h = rlog^, then there is i £ [r] and 
an induced subgraph G' of Gi on 2e~^2^~^ p^\V\ vertices that has edge density at most e. 

The key lemma in the paper of Graham, Rodl, and Rucinski [M] on the Ramsey number of graphs 
(their Lemma 1) is essentially the r = 1 case of Corollarv 13. 4[ 

4 Edge distribution in H-free graphs 

In this section, we obtain several results on the edge distribution of graphs with a forbidden induced 
subgraph which answer open questions by Nikiforov and Chung-Graham. We first prove a strength- 
ening of Rodl's theorem (mentioned in the introduction) without using the regularity lemma. Then 
we present a proof of Theorem 11.31 on the dependence of error terms in quasirandom properties. We 
conclude this section with an upper bound on the maximum edge discrepancy in subgraphs of H-fiee 
graphs. To obtain these results we need the following generalization of Lemma |2. II 

Lemma 4.1 Let H be a k-vertex graph and let G be a graph on n > k'^ vertices that contains less 
than n^{l — ^)Y[i=ii^ ~ ^i)^i~^ labeled induced copies of H, where eo = 1 and ei,6i G (0,1) for 
all 1 < i < k — 1. Then there is an index i < k — 1 and disjoint subsets A and B of G with 
\A\ > ^^^'"^^ ^j<i ^3 ^^'^ 1^1 — f nj<i such that either every vertex of A is adjacent to at most ei\B\ 
vertices of B or every vertex of A is adjacent to at least (1 — ei)|-B| vertices of B. 

Proof. Let M denote the number of labeled induced copies of H in G, which by our assumption is 
at most 

M<n''(i-^\ n(i - (1) 

^ ^ i=l 

We may assume that the vertex set of H is [k] . Consider a random partition Vi U . . . U Vfc of the vertices 
of G such that each Vi has cardinality n/k. Note that for any such partition there are (n/k)^ ordered 
/c-tuples of vertices of G with the property that the i-th vertex of the /c-tuple is in Vi for all i £ [k]. On 
the other hand the total number of ordered /c-tuples of vertices is n(n — 1) • • • (n — /c + 1) and each of 
these /c-tuples has the above property with equal probability. This implies that for any given /c-tuple 
the probability that its i-th vertex is in Vi for all i E [k] equals HiLi n-^i+i ■ particular, by linearity 
of expectation, the expected number of labeled induced copies of if in G for which the image of every 
vertex i £ [k] is in Vi is at most M ■ HiLi n-^i+i • Using that Y\{1 — Xj) > 1 — ^ Xj for any < Xj < 1 
and that n > A;^, we obtain 
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k 



fc-1 / fc-1 



) 



-1 



1 



n 



1=1 



n — i + 1 



n/k 




n 



= k 




) 



< 



This, together with ([T]), shows that there is a partition Vi U . . . U Vfe of G into sets of cardinahty n/k 
such that the total number of labeled induced copies of -ff in G for which the image of every vertex 
i £ [k] is in Vi is less than 



We use this estimate to construct sets A and B which satisfy the assertion of the lemma. For a 
vertex v € V, the neighborhood N(v) is the set of vertices of G that are adjacent to v. For v £ Vi and 
a subset S C Vj with i / j, let N{v, S) = N{v) n 5 if is an edge of H and N{v, S) = S \ N{v) 
otherwise. We will try iteratively to build many induced copies of H. After i steps, we will have 
vertices vi, . . . ,Vi with Vj G Vj for j < i and subsets K+i^,, Vi+2,i, ■ ■ ■ , Vk,i such that 

1. Ve^i is a subset of of size \Ve^i\ > j Y[j=i foi' all i + I < £ < k, 

2. for 1 < j < £ < i, {vj,vi) is an edge of G if and only if {j, I) is an edge of H, 

3. and if j < i < i and w £ Vg^i, then (vj,w) is an edge of G if and only if (j, i) is an edge of H. 

In the first step, we call a vertex v £ Vi good if |A^(f,l/i)| > ei|Vi| for each i > 1. If less than 
a fraction 1 — of the vertices in Vi are good, then, by the pigeonhole principle, there is a subset 
AcVi with 1^1 > -^l^il = -^(^^n and an index j > 1 such that \N{v, Vj)\ < ei\Vj\ for each v £ A. 
Letting B = Vj, one can easily check that A and B satisfy the assertion of the lemma. Hence, we may 
assume that at least a fraction 1 — (5i of the vertices vi £ V\ are good, choose any good v\ and define 
Vi^\ = N{vi, V) for i > 1, completing the first step. 

Suppose that after step i the properties 1-3 are satisfied. Then, in step i + we again call a vertex 
V £ Vi+i^i goodif\N{v,Vj^i)\ > ei+i\Vj^i\ for each j > i+1. If less than a fraction 1— vertices of V^+i^j 
are good, then, by the pigeonhole principle, there is a subset A C T^i+i,i with l^l > ^T^irjlFj+i^il and 
index j > i + 1 such that \N{v, V^-,j)| < ej+i|l^-^j| for each v £ A. Letting, B = Vj^i, one can check using 
properties 1-3, that A and B satisfy the assertion of the lemma. Hence, we may assume that a fraction 
1 — (5j+i of the vertices Vi+i £ V^+i^j are good, choose any good Wj+i and define T^',i+i = N{vi+i,Vj^i) 
for j > i + 2, completing step i + 1. Notice that after step i + 1, we have |V^',i+i| > ej+i|V^-^j| for 
j > i + 1, which guarantees that property 1 is satisfied. The remaining properties (2 and 3) follow 
from our construction of sets V^',i+i. 

Thus if our process fails in one of the first k — 1 steps we obtain desired sets A and B. Suppose now 
that we successfully performed k — 1 steps. Note that in step i + 1, we had at least (1 — (5i+i)|yj+i^i| > 
^{1 — Si+i) Y\j=i vertices to choose for vertex Uj+i. Also note that, by property 3, after step k — 1 




i=l 



(2) 
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we can choose any vertex in the set Vk^k-i to be Vk- Moreover, by the property 2, every choice of the 
vertices vi, . . . ,Vk form a labeled induced copy of H. Altogether, this gives at least 

4=1 ^ 0<j<i ^ j=l i=l ^ 0<j<i ^ 

= {n/k)'^ll{l-6,)et' 

i=l 

labeled induced copies of H for which the image of every vertex i E [k] is in T^. This contradicts ([2]) 
and completes the proof. □ 

Notice that the number of induced copies of H in any induced subgraph of G is at most the number 
of induced copies of H in G. Let = e < 1/2 and Si = for 1 < i < k — 1 and let a > k'^/n. Applying 
Lemma |4. II with these ei,5i to subsets of G of size an and using that > e''^^, 1 — > 1/2 we 
obtain the following corollary. 

Corollary 4.2 Let H be a graph with k vertices, a > k'^/n, e < 1/2, and G be a graph with at most 
2-'^e(2)(an)'= induced copies of H . Then the pair (G, G) is (a, 4fc^,e,2) -sparse. 

The next statement strengthens Theorem 1 1.1 1 bv allowing for many induced copies of -ff. It follows 
from Corollary 13.41 with r = 2,h = 21og(2/e), /3 = ^2k~^ combined with the last statement in which we 
set a = {p/2)^~K 

Corollary 4.3 There is a constant c such that for each e G (0, 1/2) and graph H on k vertices, every 
graph G on n vertices with less than 2"'^^'^'°^"^ induced copies of H contains an induced subgraph 
of size at least 2~^^^^°^~'^ n with edge density at most e or at least 1 — e. 

This result demonstrates that for each e G (0,1/2) and graph H, there exist positive constants 
S* = 5*{e, H) and k* = K*{e, H) such that every graph G = {V, E) on n vertices with less than k* 
induced copies of H contains a subset W <ZV oi size at least 5* n such that the edge density of W is at 
most e or at least 1 — e. Furthermore, there is a constant c such that we can take 5* (e, H) = 2"'^'^^'°^ 
and K*{e,H) = 2~'^^^^°^~^ . Applying Corollary |T3] recursively one can obtain an equitable partition 
of G into a small number of subsets each with low or high density. 

Theorem 4.4 For each e £ (0,1/2) and graph H on k vertices, there are positive constants k = 
K{e,H) and G = C{e,H) such that every graph G = {V,E) on n vertices with less than ku^ induced 
copies of H, there is an equitable partition V = Ui=i such that I < G and the edge density in each 
Vi is at most e or at least 1 — e. 

This extension of Rodl's theorem was proved by Nikiforov [41] using the regularity lemma and therefore 
it had quite poor (tower like) dependence of n and C on e and k. Obtaining a proof without using the 
regularity lemma was the main open problem raised in [41] . 

Proof of Theorem 1131 Let K{e,H) = {%fK*{^,H) and G{e,H) = 4/(e5* (|, iJ)), where k* and 5* 
were defined above. Take a subset Wi C of size (5*(|,-ff)|n whose edge density is at most | or at 
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least 1 — |, and set Ui = V\Wi. For j > 1, if \Uj\ > |n, then by definition of k we have that the number 
of induced copies of H in Uj is at most (the number of such copies in G) nn^ = {^)^k* < K*\Uj\^ . 
Therefore by definition of k* and 5* we can find a subset Wj+i C Uj of size (5*|n < 5*|f7j | whose edge 
density is at most | or at least 1 — |, and set f/j+i = Uj \ W^j+i- 

Once this process stops, we have disjoint sets . . . , W^, each with the same cardinality, and a 
subset Un of cardinality at most |n. The number d. is at most 

n/\W,\<A/{e5*{\,H)). 

Partition set C/^ into £ equal parts Ti, . . . ,T£ and let Vj = Wj U Tj for 1 < j < i- Notice that 
y = Fi U . . . U is an equitable partition of V. By definition, \Tj\ = \Ui\/i < On the other 

hand \Wj\ = {n - \Ui\)/l > (1 - e/4)n/£. Since 1 - e/4 > 7/8, this implies that 

|T,|<|n/^< J(l-e/4)-V,|<||W^,|. 

We next look at the edge density in Vj. If the edge density in Wj is at most e/4, then using the above 
bound on \Tj\, it is easy to check that the number of edges in Vj is at most 

Hence, the edge density in each such Vj is at most e. Similarly, if the edge density in Wj is at least 
1 — |, then the edge density in Vj is at least 1 — e. This completes the proof. □ 

We next use Lemma 14.11 to prove that there is a constant c > such that every graph G on n 
vertices which contains at most (1 — e)2~^'^n^ labeled induced copies of some fixed fe-vertex graph H 
has a subset S of size |5| = [n/2j with \e{S) — fg | > ec~^n^ . 

Proof of Theorem [mi For 1 < i < A; - 1, let = \{l - 2'-''-'^e) and 6i = 2^-^~'^e. Notice that for 
alH < A: - 1 

JJ ej = 2-^+1 n (1 ~ 2^"''"^e) > 2"^+^ (l - e > 2-*+i(l - e/8) > 2"* (3) 



and also that 



h—\ k 

= 2-(2)[](l-2^-'=-2e)'-^+^ = 2-(2)]J(l-e2-^-i) 

i=l i=l j=2 

k 



> 



V j=2 ^ 



We may assume that e > A:^/n since otherwise by choosing constant c large enough we get that 
ec~^nP' < 1 and the conclusion of the theorem follows easily. Therefore 



(-i:)na-*.)4--(i ' ' 

^ ' i=l 



2-(2) > (l-e)2-(2). 



2 
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IS an 



and we can apply Lemma 14.11 with Ci and 6i as above to our graph G since it contains at most 
(1 - e)2-(2) n labeled induced copies of H. This lemma, together with ([3]), implies that there 
index i < k — 1 and disjoint subsets A and B with 



\A\> 



k{k 



\{e^>k-^2 



2r,-k-2 



n. 



j<i 



j<i 



and every element of A is adjacent to at most ei\B\ elements of B or every element of A is adjacent 
to at least (1 — ei)|-B| elements of B. In either case, we have 



e{A,B)-^\A\\B\ 



1 



> I - - ei ) \A\\B\ = 2'~''~^e\A\\B\ > k-''^2~ "en". 



Note that 



e{A,B) - -\A\\B\ = e{AuB) 



AUB\ 
2 



e{A) 



l(\A\ 
2\ 2 



e{B) 



U\B\ 
2\ 2 



It follows from the triangle inequality that there is some subset of vertices R £ {A,B,A U B} such 
that 

e(i?)_ifl^l^ >lrt-2'=-W, (4) 



i.e., it deviates by at least ek'^2^'^^~^n? /3 edges from having edge density 1/2. To finish the proof 
we will use the lemma of Erdos et al. [22], mentioned in the introduction. This lemma says that if 
graph G on n vertices with edge density r] has a subset that deviates by D edges from having edge 
density rj, then it also has a subset of size n/2 that deviates by at least D/5 edges from having edge 
density rj. Note that if the edge density of our graph G is either larger than l/2 + eA;^'^2~^'^~^n^/30 or 
smaller than 1/2 — e/c~^2~^'^^^n^/30 than by averaging over all subsets of size n/2 we will find subset 
S satisfying our assertion. Otherwise, if the edge density t] of G satisfies \r]— 1/2| < ek^^2^'^^^^n'^ /30, 
then the subset R from dH) deviates by at least ek-^2-^^-^n^ /3 - ek-^2-^''-^n^ /30 > ek-^2^'^''n^ /4 
edges from having edge density r]. Then, by the lemma of Erdos et al., G has a subset S of cardinality 
n/2 that deviates by at least ek^^2^'^^~^n'^ /20 edges from having edge density rj. This 5 satisfies 



e{S)-\\S\' 



n Uk--^2 



-2k^2 



□ 



completing the proof. 

For positive integers k and n, recall that D{k,n) denotes the largest integer such that every 
graph G on n vertices that is H-fiee for some fc- vertex graph H contains a subset S of size n/2 with 
|e(S') — jqIt-"^] > D{k,n). We end this section by proving the upper bound on D{k,n). 

Proposition 4.5 There is a constant c > such that for all positive integers k and n > 2/^1"^ , there 
is a K^-free graph G on n vertices such that for every subset S of n/2 vertices of G, 



e{S) - —n^ 
^ ' 16 



< 



c2~^'^n' 



16 



Proof. Consider the random graph 1/2) with I = 2'^/^. For every subset of vertices X in this 
graph the number of edges in X is a binomially distributed random variable with expectation l^'^KI^I"-^^ . 
Therefore by Chernoff's bound (see, e.g., Appendix A in [5]), the probabihty that it deviates from this 
value by t is at most 2e~* /'"'^l . Thus choosing t = 1.51^^"^ we obtain that the probability that there 
is a subset of vertices X such that |e(X) — ^^^^^'^^ — ^| > t is at most 2^ • 2e~*^/^^ <C 1. This implies 
that there is graph F on ^ vertices such that every subset X of F satisfies 

e(X)-^|X|2| < 2^3/2. (5) 

Let G be the graph obtained by replacing every vertex u of F with an independent set lu, of size 
n/£, and by replacing every edge {u,v) of F with a complete bipartite graph, whose partition classes 
are independent sets and ly. Clearly, since F does not contain K/^, then neither does G. We claim 
that graph G satisfies the assertion of the proposition. Suppose for contradiction that there is a subset 
S of n/2 vertices of G satisfying 
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(the other case when e{S) - n^/lQ < -At 

1/2^2 pg^j^ treated similarly). For every vertex u S F 
let the size of H /„ be aun/i. By definition, < a„ < 1 and since S has size n/2 we have that 
J2u ~ We also have that 

(u,v)eE(r) 

and therefore 

J2 auay > f/16 + 4^3/2 = 1 ( J] a„) ' + 41'/'. 
(u,v)eE{r) u 

Consider a random subset y of F obtained by choosing every vertex u randomly and independently 
with probability a„. Since all choices were independent we have that 

We also have that the expected number of edges spanned by Y is E[e(y)] = Yl(u v)eE{r) '^uCLv- Then, 
by the above discussion, IE[e(y) — |y|2/4] > 3£^/2, particular, there is subset y of F with this 
property, which contradicts ([5]). This shows that every subset S oi n/2 vertices of G satisfies 



and completes the proof. □ 



5 Induced Ramsey Numbers and Pseudorandom Graphs 

The main result in this section is Theorem 15.41 which shows that any sufficiently pseudo-random 
graph of appropriate density has strong induced Ramsey properties. It generalizes Theorem 11.41 and 
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Corollary 11.61 from the introduction. Combined with known examples of pseudo-random graphs, this 
theorem gives various explicit constructions which match and improve the best known estimates for 
induced Ramsey numbers. 

The idea of the proof of Theorem 15. 41 is rather simple. We have a sufficiently large, pseudo-random 
graph G that is not too sparse or dense. We also have d-degenerate graphs Hi and H2 each with 
vertex set [k] and chromatic number at most q. We suppose for contradiction that there is a red-blue 
edge-coloring of G without an induced red copy of Hi and without an induced blue copy of H2. We 
may view the red-blue coloring of G as a red-blue-green edge-coloring of the complete graph K^q^, in 
which the edges of G have their original color, and the edges of the complement G are colored green. 
The fact that in G there is no induced red copy of Hi means that the red-blue-green coloring of i^|G| 
does not contain a particular red-green coloring of the the complete graph Ki^. Then we prove, similar 
to Lemma |2. II of Erdos and Hajnal, that any large subset of vertices of G contains two large disjoint 
subsets for which the edge density in color red between them is small. By using the key lemma from 
Section 3, we find k large disjoint vertex subsets Vi, . . . ,Vk of G for which the edge density in color 
red is small between any pair (V^, Vj) for which an edge of H2. 

Next we try to find an induced blue copy of H2 with vertex i in Vi for all i £ [k]. Since the 
edge density between Vi and Vj in color red is sufficiently small for every edge of H2, we can 

build an induced blue copy of H2 one vertex at a time. At each step of this process we use pseudo- 
randomness of G to make sure that the existing possible subsets for not yet embedded vertices of H2 
are sufficiently large and that the density of red edges does not increase a lot between any pair of 
subsets corresponding to adjacent vertices of H2. This last part of the proof, embedding an induced 
blue copy of H2, is the most technically involved and handled by Lemma |5.5[ 

Recall that [i] = {1, . . . , i} and that a graph is d-degenerate if every subgraph has a vertex of degree 
at most d. For an edge-coloring ^' : E{Kk) [r], we say that another edge-coloring $ : E{Kn) — > \s\ 
is ^-free if, for every subset W of size k of the complete graph Kn, the restriction of $ to is not 
isomorphic to In the following lemma, we have a coloring ^ of the edges of the complete graph 
with colors 1 and 2 such that the graph of color 2 is d-degenerate. We also have a ^'-free coloring <I> 
of the edges of the complete graph Kn such that between any two large subsets of vertices there are 
sufficiently many edges of color 1. With these assumptions, we show that there are two large subsets 
of Kn which in coloring <I> have few edges of color 2 between them. A graph G is bi-{e,S)-dense if 
d{A,B) > e holds for all disjoint subsets A, B c V{G) with \A\,\B\ > 6\V{G)\. 

Lemma 5.1 Let d and k be positive integers and ^ : E{Kk) [2] be a 2-coloring of the edges of 
K]. such that the graph of color 2 is d-degenerate. Suppose that (7, e € (0, 1) and <5 : E{Kn) — > [s] is 
a "if-free edge-coloring such that the graph of color 1 is bi-{q,e'^q^k~'^)-dense. Then there are disjoint 
subsets A and B of Kn with \A\, \B\ > e'^q^k~'^n such that every vertex of A is connected to at most 
€\B\ vertices in B by edges of color 2. 

Proof. Note that from definition, the vertices of every d-degenerate graph can be labeled 1,2,... 
such that for every vertex £ the number of vertices j < i adjacent to it is at most d. (Indeed, remove 
from the graph a vertex of minimum degree, place it in the end of the list and repeat this process in 
the remaining subgraph.) Therefore we may assume that the labeling 1, . . . , fc of vertices of K^ has 
the property that for every i £ [k] there are at most d vertices j < i such that the color ^(j, = 2. 
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Partition the vertices of Kn into sets Vi U . . . U each of size ^ . For w £ Vi and a subset S cVj with 
j ^ i, let N{w,S) = {s £ S \ s) = j)}. For i < i, let D{i,i) denote the number of vertices 
j < i such that the color ^'(j, i) = 2. By the above assumption, D{1, i) < d for 1 < i < i < k. 

We will try iteratively to build a copy of with coloring ^. After i steps, we either find two 
disjoint subsets of vertices A, B which satisfy the assertion of the lemma or we will have vertices 
ui, . . . , f j and subsets ^j+i^i, Vi+2,i-, ■ ■ ■ , Vk,i such that 

1. Ve^i is a subset of Ve of size \Vi^i\ > e^(^.Og»-^(«.i)|lA^| for alH + 1 < ^ < k, 

2. ^{vj,ve) = fovl<j <£<i, 

3. and if j < i < £ and w G V^^j, then ^{vj,w) = 

In the first step, we call a vertex w £ Vi good if \N{w, Vj)\ > €\Vj\ for all j > 1 with ^(1, j) = 2 
and \N{w,Vj)\ > q\Vi\ for all j > 1 with j) = 1. If there is no good vertex in Vi, then there is 
a subset A C Vi with |^| > ^pjlVil and index j > I such that either ^'(1, j) = 1 and every vertex 
w G A has fewer than q\Vj\ edges of color 1 to Vj or \l'(l,j) = 2 and every vertex w £ A is connected 
to less than e|V^| vertices in Vj by edges of color 2. Letting B = Vj, we conclude that the first case 
is impossible since the graph of color 1 is hi-{q, e'^g'^A;~^)-dense, while in the second case we would be 
done, since A and B would satisfy the assertion of the lemma. Therefore, we may assume that there 
is a good vertex vi G Vi, and we define Vi^i = N{vi, Vi) for i > 1. 

Suppose that after step i the properties 1-3 are still satisfied. Then, in step i + a vertex w £ Vi+i^i 
is called good if \N{w, Vj,i)\ > e|Vj-,i| for each j > i + 1 with + 1, j) = 2 and \N{w, V^-,j)| > Q\Vj,i\ for 
each j > i + 1 with + 1, j) = 1. If there is no good vertex in T^j+i^j, then there is a subset A C V^+i^j 
with \A\ > j^_\_i \Vi-^-i^i\ and j > i + 1 such that either ^'(z + l,j) = 1 and every vertex w £ A has 
fewer than q\Vj^i\ edges of color 1 to Vj^i or ^'(l,j) = 2 and every vertex w £ A is connected to less 
than e|Vj-,j| vertices in Vj^i by edges of color 2. Note that even in the last step when i + 1 = k the size of 
A is stih at least \Vk,k-i\/k > e'^q^\Vk\/k > e'^q^k-'^n. Therefore, letting B = Vj^i, we conclude that as 
before the first case is impossible since the graph of color 1 is bi-(g, e'^(7'^A;^^)-dense, while the second 
case would complete the proof, since A and B would satisfy the assertion of the lemma. Hence, we 
may assume that there is a good vertex Vi+i £ and we define = N{vi+i, Vj^i) for j > i + 1. 

Note that \Vj^i+i\ > q\Vj^i\ if ^{i + l,j) = 1 and \Vj,i+i\ > e\Vj^i\ if ^{i + 1, j) = 2. This implies that 
after step i + 1 we have that |V^,i+i| > e^5('^.»+i)g»+i-^(Ai+i)| for alH + 2 < £ < k. 

The iterative process must stop at one of the steps j < k — 1, since otherwise the coloring $ would 
not be ^'-free. As we already explained above, when this happens we have two disjoint subsets A and 
B that satisfy the assertion of the lemma. □ 

Notice that if coloring <I> : Kn — > [s] is ^-free, then so is ^ restricted to any subset of of size 
an. Therefore, Lemma lS.ll has the following corollary. 

Corollary 5.2 Let d and k be positive integers and ^ : E{Kk) — > [2] he a 2- coloring of the edges of 
Kk such that the graph of color 2 is d-degenerate. If q,a,e £ (0,1) and <I> : E{Kn) [s] is a ^-free 
edge-coloring such that the graph of color 1 is bi-{q,ap)-dense with p = e'^q^k~^, then the graph of 
color 2 is (a, p, e, 2) -sparse. 

The next statement follows immediately from Corollary 15.21 (with e /4 instead of e) and Corollary 13.31 



19 



Corollary 5.3 Let d, k, and h be positive integers and ^ : E{Kk) [2] be a 2-coloring of the edges 
of such that the graph of color 2 is d-degenerate. Suppose that q,a,e £ (0, 1) and ^ : E{Kn) — > [s\ 
is a '^-free edge-coloring such that the graph of color 1 is bi-{q,ap) -dense with p = {e/ A^q^k^'^ . Then 
the graph of color 2 is {{j^)^~^a,2^~^p^,e,2^)-sparse. 

Pending one additional lemma, we are now ready to prove the main result of this section, showing 
that pseudo-random graphs have strong induced Ramsey properties. 

Theorem 5.4 Let x ^ 2 and G be a (p, X)-pseudo-random graph with < p < 3/4 and A < 
((lUfc)'^^^^^)^'^^"^'^"" ^^^^ every d-degenerate graph on k vertices with chromatic number at most 
X occurs as an induced monochromatic copy in every 2-coloring of the edges of G. Moreover, all of 
these induced monochromatic copies can be found in the same color. 

Taking p = 1/k, n = k""^^"^^ and constant c sufficiently large so that {{-^f2-P^f^^°^^ > n'^-'^ 
one can easily see that this result implies Theorem 11.41 To obtain Corollary 11.61 recall that for a 
prime power the Paley graph 7^ has vertex set IF^ and distinct vertices x, ?/ G IF^ are adjacent if 
X — y is a square. This graph is (1/2, A)-pseudo-random with A = ^/n (see e.g., [38]). Therefore, for 
sufficiently large constant c, the above theorem with n = 2'^'^^°^ p = 1/2 and d = x = ^ implies 
that every graph on k vertices occurs as an induced monochromatic copy in all 2-edge-colorings of 
the Paley graph. Similarly, one can prove that there is a constant c such that, with high probability, 
the random graph G{n, 1/2) with n > 2^^^°^^ satisfies that every graph on k vertices occurs as an 
induced monochromatic copy in all 2-edge-colorings of G. 

Proof of Theorem 15.41 Suppose for contradiction that there is an edge-coloring <I>o of G with colors 
red and blue, and d-degenerate graphs Hi and H2 each having k vertices and chromatic number at 
most X such that there is no induced red copy of Hi and no induced blue copy of H2 ■ Since Hi , H2 
are d-degenerate graphs on k vertices we may suppose that their vertex set is [k] and every vertex i 
has at most d neighbors less than i in both Hi and H2. 

Consider the red-blue-green edge-coloring $ of the complete graph Kn, in which the edges of G 
have their original coloring $0) and the edges of the complement G are colored green. Let ^ be the 
edge-coloring of the complete graph K^. where the red edges form a copy of Hi and the remaining 
edges are green. By assumption, the coloring $ is ^'-free. Since G is (p, A)-pseudo-random, we have 
that the density of edges in G between any two disjoint sets A,B of size at least Qp~^X is at least 

daiA, B) = l- dciA, B)>l-(p+ -/==j ^ ^ " ^^'^ 

Therefore the green graph in coloring $ is bi-(g, 6p~^^)-dense for q = 1 — 7p/6. 

^ ^ ToTOfc^' P ^ (e/4)''Q^fc~^, h = logx, and a = {p/2)^~'^. Using that q = 1 - 7p/6 and 
A/n < ((^)'^2-P^)20i°sx it is straightforward to check that 6p''^^ < 2^-^'p^ = ap. By Corollary [O] 
and Definition 13.11 there are 2^ = x subsets Wi, . . . , of with \ Wi\ = ... = \W^\ > 2^~^p^n, 
such that the sum of densities of red edges between all pairs Wi and Wj is at most i^e^- Hence, the 
density between Wi and Wj is also at most x^e for all 1 < z < j < x- Partition every set Wi into k 
subsets each of size \Wi\/k > ^2^~^ p^n. Since the chromatic number of H2 is at most x and it has 
k vertices, we can choose for every vertex i of H2 one of these subsets, which we call Vi, such that 
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all subsets corresponding to vertices of H2 in the same color class (of a proper x-coloring) come from 
the same set W^. In particular, for every edge (i, j) of H2., the corresponding sets Vi and Vj lie in two 
different sets {VF^}. Since the size of ViS is by a factor k smaller than the size of W^'s the density of 
red edges between Vi and Vj corresponding to an edge in H2 is at most k'^'X^e < ^qq^^^ (note that it 
can increase by a factor at most k"^ compare to density between sets {W^}). Notice that the subgraph 
G' C G induced by Vi U . . . U Vfc has n' > 2^~^p^n vertices and is also {p, A)-pseudo-random. By the 
definitions of p and h, and our assumption on A, we have that 

// 71 \d \201ogx // „ \d 

V«'<2-V-A/„<2-V-((4)2-') <((4)2-') . 

Applying Lemma 15.51 below with H = H2 to the coloring <I>o of graph G' with partition Vi U . . . U V^, 
we find an induced blue copy of completing the proof. □ 

Lemma 5.5 Let H he a d-degenerate graph with vertex set [k] such that each vertex i has at most d 
neighbors less than i. Let G = [V, E) he a {p, X) -pseudo-random graph on n vertices with < p < 3/4, 
A < ((^)'^2~P'^)^''n and let V = ViU . . .UVk he a partition of its vertices such that each Vi has size 
njk. Suppose that the edges of G are 2-colored, red and hlue, such that for every edge of H, the 
density of red edges between the pair {Vj, Vi) is at most (3 = jq^^^- Then there is an induced blue copy 
of H in G for which the image of every vertex i G [k] lies in Vi. 

Proof. For i < j, let D{i,j) denote the number of neighbors of j that are at most i. Let ^1 = ^, 
£2 = and 6 = {I -p^p'^. Since p < 3/4, notice that 5 > 2~^P''p'^ and 

X<((^Y2-A'\<^4^5'n. (6) 

We construct an induced blue copy of H one vertex at a time. At the end of step i, we will have 
vertices vi, . . . ,Vi and subsets Vj^i C Vj for j > i such that the following four conditions hold 

1. for J, ^ < i, if (j, i) is an edge of H, then {vj,Vi) is a blue edge of G, otherwise Vj and vi are not 
adjacent in G, 

2. for j < i < i, if (j, i) is an edge of H, then Vj is adjacent to all vertices in Vi,i by blue edges, 
otherwise there are no edges of G from Vj to Vi^i, 

3. for i < j, we have \ Vj^i\ > (l - p - €2y-^^''^\p - €2)^^''^^\Vj\, 

4. and for j,£ > i if {j, £) is an edge of H, then the density of red edges between Vj^i and Vi^i is at 
most (1 + eiYP. 

Clearly, in the end of the first k steps of this process we obtain a required copy of H. For i = 
and j G [k] , define Vj^o = Vj . Notice that the above four properties are satisfied for i = (the first two 
properties being vacuously satisfied). We now assume that the above four properties are satisfied at 
the end of step i, and show how to complete step i + 1 by finding a vertex Vi+i S V^+i^j and subsets 
Vj^i^i C Vj^i for j > i -\- 1 such that the conditions 1-4 still hold. 

We need to introduce some notation. For a vertex w & Vj and a subset S C V^ with j / i, let 
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• N{w,S) denote the set of vertices s E S such that {s,w) is an edge of G, 

• B{w, S) denote the set of vertices s e S such that {s,w) is a blue edge of G, 

• R{'w, S) denote the set of vertices s e S such that (3,111) is a red edge of G, 

• N{w, S) = N{w, S) if is an edge of H and N{w, S) = S \ N{w, S) otherwise, 

• B{w, S) = B{w, S) if {j,£) is an edge of H and B{w, S) := S \ N{w, S) otherwise, and 

• Pj,e = P if (j, £) is an edge of H and pj^e = 1 — ^? if {j, t) is not an edge of H. 

Note that since graph G is pseudo-random with edges density p, by the above definitions, for every 
large subset S C and for most vertices w ^Vj we expect the size of N{w, S) to be roughly Pj/\S\. 

We also have for ah 5 C and G Vj that B{w, S) = N{w, S) \ R{w, S). 

Call a vertex w G good if for all j > i + 1, B(w, Vj^i) > (pi+ij — e2)|^',i| and for every edge 

of H with j,i > i + 1, the density of red edges between B{w,Vj_i) and B{w,Ve_i) is at most 
(1 + eiy~^^p. If we find a good vertex w G V^+i^j, then we simply let fj+i = w and = B{w, Vj^i) 

for _7 > i + 1, completing step i + It therefore suffices to show that there is a good vertex in V^+i^j. 

We first throw out some vertices of ensuring that the remaining vertices satisfy the first of 

the two properties of good vertices. For j > i + 1 and an edge (i + 1, j) of H, let Rj consist of those 
w G for which the number of red edges {w,Wj) with Wj G Vj^i is at least ^l^'i*!- Since the 

density of red between Vi+i,i and Vj^i is at most (1 + ei)*/3, then Rj contains at most 

IP I ^ (1 + _o/i I , M-ia\^r I 

2 I I 

vertices. Let V' be the set of vertices in V^+i^j that are not in any of the Rj. Using that ei = £2 = 
TM and /? = we obtain 



\v'\ > \Vi+i,i\ - J2 ^ - ^(2(1 + ^iy^2^m+i,i\) 

j>i+l 

> (1 - 2k{l + ei)'=e2 > 



For J > i + 1, let Sj consist of those w £ V for which N(w, Vj^i) < (pi+ij — y)!^,?!- Then the 
density of edges of G between Sj and Vj^i deviates from p by at least ^- Since graph G is (p, X)- 
pseudo-random, we obtain that ^ ^ /i ^ n and hence \Sj\ < . . Also using that p < 3/4 

we have 1 — p — €2 = 1 — p — ^ > (1 — — P)- Therefore, our third condition, combined with 

(5 = (1 — and (1 — x)* > 1 — xt for all < x < 1, imply that ioi j > i + 1 



\Vj,i\ > {I - p - e2y-''^''^\p - e2)''^'^''^\Vj\ > {I - p - e2)Hp - ^2f\Vji 

k 



> (1-^) (l-T^l (I-P)VIVSI 
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Since A < j^j^n (see ([6])) and €2 = we therefore have \Sj\ < ^^^^ ^ < j^\Vi-^.i^i\. Let V" be the 
set of vertices in V' that are not in any of the sets Sj. The cardinahty of V" is at least 

\V"\ > \V'\ - > \V'\ - k . {^\V^+l,i\) > \V'\ - ^\Vi+i,\ > 

j>i+l 

Moreover, by definition, for every j > i + 1 and every vertex w £ V" there are \R{w, Vj^i)\ < yI^jA 
edges from w to Vj^i if (i + is an edge of H and also N{w, Vj^i) has size at least {pi+ij — y)!^',*!- 
This implies that 

\B{w,Vj,i)\ = \N{w,Vj,i) \ R{w,Vj,i)\ > \N{w,Vj,i)\ - ||V,-i| > {pi+i,j - e2)\Vj,i\ 

and therefore the vertices of V" satisfy the first of the two properties of good vertices. 

We have reduced our goal to showing that there is an element of V" that has the second property 
of good vertices. For i + l<j<i<k and (j, I) an edge of H, let Tj^i denote the set of u; G V" such 
that the density of red edges between B{'w, Vj^i) and B{w, Vi^i) is more than (1 + ei)*+^/3. Notice that 
any vertex of V" not in any of the sets Tj^i is good. Therefore, if we show that Tj^£ < ^-^^ for each 
Tj £, then there is a good vertex in V" and the proof would be complete. To do so we will assume 
without loss of generality that Pi+i,j and pi+i/ are both p (the other 3 cases can be treated similarly 
using the fact that G is (1 — p, A)-pseudo-random). Since by ((Tj) we have that | V^^ j | , | j | > and 

\'V"\ 1 I I 

-£r- > 4p-|Vi4.i,i| > gp-, the result follows from the following claim. 

Claim 5.6 Let X,Y and Z be three disjoint subsets of our (p, \) -pseudo-random graph G such that 
\X\ > ^ and \Y\,\Z\ > For every w £ X let Bi{w),B2{w) be the set of vertices in Y and Z 
respectively connected to w by a blue edge and suppose that \Bi{w)\ > {p — xofc)|^| O'nd \B2{w)\ > 
(p — ■^)\Z\. Also suppose that the density of red edges between Y and Z is at most rj for some 
r] > ■ Then there is a vertex w £ X such that the density of red edges between Bi{w) and B2{w) 

is at most ^-^r]- 

Proof. Let m denote the number of triangles {x,y,z) with x£X,y£Y,z£Z, such that the edge 
(y, z) is red. We need an upper bound on m. Let U be the set of vertices in Y that have fewer than 
p'^5'^(Wk)~^^n red edges to Z. So the number mi of triangles {x,y,z) which have y G U and edge 
(y, z) red is clearly at most mi < p'^6'^{Wk)~^^n^ . Let Wi,W2 denote the subsets of vertices in Y 
whose number of neighbors in X is at least {p+ -^)\X\ or respectively at most {p— 20fc)l^l- Since the 
density of edges between Wi and X deviates from p by more than using {p, A)-pseudo-randomness 
of G, we have ^ < -^7j=p=^, or equivalently, \X\\Wi\ < AOOk'^p^^X'^. Therefore, using the upper 

bound A < (x(j)e)io from the number 77x2 of triangles (x, y, z) with y gW = Wi U W2 and edge 
(y, z) red is at most 

m2 < \X\\W\n < Smk^p-'^X^n < {lQk)~^^p'^5'^n^ . 

For y Y \{U W), we have the number of neighbors of y in X satisfy — p| ^ 20A; 

and the number of red edges from y to Z is at least p^5^{lQk)~'^^n. Recall that R{y,Z) denotes the 
set of vertices in Z connected to y by red edges, hence we have that \R{y,Z)\ > p^6^{10k)~^^n for 
every y e Y\{UUW). We also have that \N{y,X)\ > p\X\/2 > Since G is (p, A)-pseudo- 

random, we can bound the number of edges between N{y,X) and R{y,Z) by p\N{y, X)\\R{y, Z)\ + 
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X^y\I^njj^X)\\R(jj^Z) \ . Using the above lower bounds on \N{y, X)\ and \R{y, Z)\, and the upper bound 
([6|) for A, one can easily check that 

A X p 



y^\N{y,X)\\R{y,Z)\ ^ [p5n / {IQk^)) {p^5'^{l{)k)-^^n) 
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Hence the number of edges between A^(y, X) and Z) is at most {p+-^)\N{y, X)\\R(y, Z)\. Recall 
that for aW y e Y \ {U U W) we have that \N{y, X)\ < [p + ^) \X\. Also, since the density of red 
edges between Y and Z is at most r], we have that \Riy, Z)\ < rj\Y\\Z\. Therefore, the number 
of triangles (x, y, z) with y £ Y \ {U U W),x £ X, z £ Z such that the edge (y, z) is red is at most 

^ \Niy,X)my,Z)\ < + Z)| < + Vll^^ll^l- 

y&Y\{UUW) y 

Using the lower bounds on |X|, |Z|, ry from the assertion of the claim we have that 



p^r7|X||y||Z| > ^ " > (10A;)^max(mi,m2). 

This implies that the total number of triangles {x,y,z) with x G X,y £ Y, z £ Z , such that the edge 
(y, z) is red is at most 

^ ^F^V\X\\Y\\Z\ f p ^2 
m = m, + m, + ms<2^^^^ + (p+—)rj\X\\Y\\Z\ 

< {l + l/{8k))p'^7]\X\\Y\\Z\. 

Therefore, there is vertex w £ X such that the number of these triangles through w is at most 
(1 + l/{8k))p^T]\Y\\Z\. Since Bi{w) C N{w,Y) and B2{w) C N{w, Z), then the number of red edges 
between i?i (tu) andi?2('U^) is at most {l + l/{8k))p'^r]\Y\\Z\. Since we have that |i?i(tt;)| > (p— iofc)|^| 
and |S2(iL')| > ip — lofc)!-^!) the density of red edges between Bi{w) and 62(10) can be at most 

il + l/i8k))p^r]\Y\\Z\ ^ (1 + l/(8fc)y7? ^ k+J_ 
\B,{w)\\B2{w)\ - {p-^,? - k 

completing the proof. □ 



6 Trees with superlinear induced Ramsey numbers 

In this section we prove Theorem 1 1.7( that there are trees whose induced Ramsey number is superlinear 
in the number of vertices. The proof uses Szemeredi's regularity lemma, which we mentioned in the 
introduction. 

A red-blue edge-coloring of the edges of a graph partitions the graph into two monochromatic sub- 
graphs, the red graph, which contains all vertices and all red edges, and the blue graph, which contains 
all vertices and all blue edges. The weak induced Ramsey number r^eak ind(-f^i) -^^2), introduced by 
Gorgol and Luczak [33], is the least positive integer n such that there is a graph G on n vertices such 
that for every red-blue coloring of the edges of G, either the red graph contains Hi as an induced 
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subgraph or the blue graph contains H2 as an induced subgraph. Note that this definition is a relax- 
ation of the induced Ramsey numbers since we allow blue edges between the vertices of red copy of 
Hi or red edges between the vertices of blue copy of H2. Therefore a weak induced Ramsey number 
lies between the usual Ramsey number and the induced Ramsey number. Using this new notion we 
can strengthen Theorem 11.71 as follows. Recall that Ki^k denotes a star with k edges. 

Theorem 6.1 For each a G (0,1), there is a constant k{a) such that if H is a graph on k > k{a) 
vertices with maximum independent set of size less than (1 — a)k, then r^eak ind{H, Ki^k) > ^• 

Let r be a tree which is a union of path of length k/2 with the star of size k/2 such that the end point 
of the path is the center of the star. Since T contains the path Pk/2 and the star as induced 

subgraphs, then rind(T') > 

'"weak ind -f^i,fe/2)- By using the above theorem with k/2 instead of k, 

H = and sufficiently small a, we obtain that r\^f]^[T)/k 00. Moreover the same holds for every 
sufficiently large tree which contains a star and a matching of linear size as subgraphs. We deduce 
Theorem 16.11 from the following lemma. 

Lemma 6.2 For each 5 > there is a constant > such that if G = {V, E) is a graph on n 
vertices, then there is a 2- coloring of the edges of G with colors red and blue such that the red graph 
has maximum degree less than 5n and for every subset W C V, either there are at least c^v? blue edges 
in the subgraph induced by W or there is an independent set in W in the blue graph of cardinality at 
least \W\ — 5n. 

Proof. Let e = j^q. By Szemeredi's regularity lemma, there is a positive integer M(e) together with 
an equitable partition V = [J^^i Vi of vertices of the graph G = {V, E) into k parts with ^ < fc < M(e) 
such that all but at most efc^ of the pairs iVi^Vj) are e-regular. Recall that a partition is equitable if 
\\yi\ — iVjll < 1 and a pair {Vi, Vj) is called e-regular if for every X C Vi and Y C Vj with \X\ > e\Vi\ 
and \Y\ > €\Vj\, we have \d{X,Y) - d(yi,Vj)\ < e. Let cs = eM{e)-'^. Notice that to prove Lemma 
16.21 it suffices to prove it under the assumption that n is sufficiently large. So we may assume that 
n > e-^M(e). 

If a pair {Vi, Vj) is e-regular with density diyi, Vj) at least 2e, then color the edges between Vi and 
Vj blue. Let G' be the subgraph of G formed by deleting the edges of G that are already colored blue. 
Let V be the vertices of G' of degree at least 5n. Color blue any edge of G' with a vertex in V . The 
remaining edges are colored red. First notice that every vertex has red degree less than 5n. 

We next show that \V'\ is small by showing that G' has few edges. There are at most 




edges {v,w) of G with v and w both in the same set Vi. Since at most eA;^ of the pairs (Vi,V^) are 
not e-regular, then there are at most en^ edges in such pairs. The e-regular pairs {yi,Vj) with density 
less than 2e contain at most a fraction 2e of all possible edges on n vertices. So there are less than 
en^ edges of this type. Therefore the number of edges of G' is at most 3en^, and therefore there are 
at most < 2 ^^F ^ < Qe5~^n < Iry vertices of degree at least 5n in it. 
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LetW CV. Let W = W\V', so W has cardinality at least \W\-^. LetWi = VinW'. Let 
= U|VFi|>e|yi| Notice that for any i e [A;] there are at most vertices in {W' \ W") n Vi, so 
there are at most k{e'^) = en = vertices in W' \ W" . Therefore, W" has at least \W\ — 5n vertices. 
If there are « / j such that |Wj|, \ Wj\ > ej and the pair {Vi,Vj) is e-regular with density at least 2e, 
then there are at least 

e|Wi||Ty,-| > -^n^ > eM{e)-'^n^ = c&r? 

blue edges between Wi and Wj. In this case the blue subgraph induced by W has at least ci,r? edges. 
Otherwise, all the edges in W" are red, and W" is an independent set in the blue graph of cardinality 
at least — bn. □ 

Proof of Theorem 16.11 Let i/ be a graph on k vertices with maximum independent set of size less 
than (1 — OL)k. Take b = o? and C5 to be as in Lemma [6. 2 [ Let G = (y, E) be any graph on n vertices, 
where n If H has at least cgk'^ edges, consider a random red-blue coloring of the edges of G such 
that the probability of an edge being red is ^. The expected degree of a vertex in the red graph is 
at most an/2. Therefore by the standard Chernoff bound for the Binomial distribution it is easy to 
see that with probability 1 — o(l) the degree of every vertex in the red graph is less than an < k, i.e., 
it contains no Ki ^. On the other hand, for k sufficiently large, the probability that the blue graph 
contains a copy of H is at most 

^^(1 _ a/2y^-^'^ < nk^-acsk'^/2 ^ g-acaA;2/2+fclog(fc/a) ^ ^^^-j^ 

Thus with high probability this coloring has no blue copy of H as well. This implies that we can 
assume that the number of edges in H is less than c^/c^. 

By Lemma 16.21 there is a red-blue edge-coloring of the edges of G such that the red graph has 
maximum degree at most 5n and every subset W C V contains either an independent set in the 
blue graph of size at least \W\ — 6n or contains at least c^n^ blue edges. Since 6n = o?n < k, 
then the red graph does not contain Ki^k as a subgraph. Suppose for contradiction that there is 
an induced copy of H in the blue graph, and let W be the vertex set of this copy. The blue graph 
induced by W has e{H) < csk"^ < c^n^ edges. Therefore it contains an independent set of size at least 
\W\ —5n> \W\ —ak = {l — a)k, contradicting the fact that H has no independent set of size (1 — a)k. 
Therefore, there are no induced copies of H in the blue graph. □ 

7 Concluding Remarks 

• All of the results in this paper concerning induced subgraphs can be extended to many colors. 
One such multicolor result was already proved in Section [5] (see Lemma 15. ip , and we use here 
the notation from that section. For example, one can obtain the following generalization of 
Theorem ll.il For A; > 2, let 'J' : E{Kk) [r] be an edge-coloring of the complete graph and 
$ : E{Kn) [s] be a ^'-free edge-coloring of the complete graph Kn- Then there is a constant c 
so that for every e G (0, 1/2), there is a subset W C Kn of size at least 2~'^^^^^°^'^^ n and a color 
i G [r] such that the edge density of color i in is at most e. Since the proofs of this statement 
and other generalizations can be obtained using our key lemma in essentially the same way as 
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the proofs of the results that we aheady presented (which correspond to the two color case), we 
do not include them here. 

• It would be very interesting to get a better estimate in Theorem 11.11 This will immediately 
give an improvement of the best known result for Erdos-Hajnal conjecture on the size of the 
maximum homogeneous set in H-bee graphs. We believe that our bound can be strengthened 
as follows. 

Conjecture 7.1 For each graph H, there is a constant c{H) such that if e £ (0, 1/2) and G is 
a H-free graph on n vertices, then there is an induced subgraph of G on at least e'^^^^n vertices 
that has edge density either at most e or at least 1 — e. 

1 

This conjecture if true would imply the Erdos-Hajnal conjecture. Indeed, take e = n c(h7+t_ 

Then every H-fiee graph G on n vertices contains an induced subgraph on at least e^^^^n = 
1 

^c(H)+i vertices that has edge density at most e or at least 1 — e. Note that this induced 
subgraph or its complement has average degree at most 1 , which implies that it contains a clique 
or independent set of size at least 2^''*^'"''^ • 

• One of the main remaining open problems on induced Ramsey numbers is a beautiful conjecture 
of Erdos which states that there exists a positive constant c such that rind(-ff) < 2^^^^ for every 
graph H on k vertices. This, if true, will show that induced Ramsey numbers in the worst case 
have the same order of magnitude as ordinary Ramsey numbers. Our results here suggest that 
one can attack this problem by studying 2-edge-colorings of a random graph with edge probability 
1/2. It looks very plausible that for sufficiently large constant c, with high probability random 
graph G(n, 1/2) with n > 2'^^ has the property that any of its 2-edge-colorings contains every 
graph on k vertices as an induced monochromatic subgraph. Moreover, maybe this is even true 
for every sufficiently pseudo-random graph with edge density 1/2. 

• The results on induced Ramsey numbers of sparse graphs naturally lead to the following ques- 
tions. What is the asymptotic behavior of the maximum of induced Ramsey numbers over all 
trees on k vertices? We have proved r^^i^iT) is super linear in k for some trees T. On the other 
hand. Beck [8] proved that ri^d{T) = O [k"^ log^ k) for all trees T on k vertices. 

For induced Ramsey numbers of bounded degree graphs, we proved a polynomial upper bound 
with exponent which is nearly linear in the maximum degree. Can this be improved further, e.g., 
is it true that the induced Ramsey number of every n-vertex graph with maximum degree d is 
at most a polynomial in n with exponent independent of d? It is known that the usual Ramsey 
numbers of bounded degree graphs are linear in the number of vertices. 
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